Contour vs Non-Contour based Word Segmentation from Handwritten Text Lines- an Experimental Analysis

نویسندگان

  • Fajri Kurniawan
  • Amjad Rehman
  • Dzulkifli Bin Mohamad
چکیده

This paper compares contour based and noncontours based techniques for extracting words from unconstrained handwritten text lines. Proposed novel approach is based on contours of the words rather only considering threshold for inter-word gaps as previous studies. In this approach, contour of each word is examined along with threshold for inter-word gaps to extract words with high confidence. Unlike previous studies, preprocessing technique is not applied, that enhance the speed significantly. Furthermore, a simple technique for punctuation detection is proposed to increase accuracy of word extraction. For fair comparison text lines are taken randomly from IAM benchmark database and threshold calculation is kept same for all techniques. Experiments thus performed, exhibit improved results and speed over the conventional word extraction methods. Furthermore, developed techniques and results are compared with the other approaches available in the literature using same benchmark database.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical Approach for Segmenting Unconstrained Handwritten Text lines

The segmentation of unconstrained handwritten text lines into words is an important stage in word recognition systems. This paper addresses a methodology to overcome the challenges, which are amplified by the non-uniform spaces between words and overlapping components by using a few statistical approaches. The system was developed using Java 2 and ImageJ tool. In this approach, a text line imag...

متن کامل

A Modified Character Segmentation Algorithm for Farsi Printed Text Using Upper Contour Labelling

In this paper, a modified segmentation algorithm for printed Farsi words is presented. This algorithm is based on a previous work by Azmi that uses the conditional labeling of the upper contour to find the segmentation points. The main objective is to improve the segmentation results for low quality prints. To achieve this, various modifications on local baseline detection, contour labeling an...

متن کامل

A Modified Character Segmentation Algorithm for Farsi Printed Text Using Upper Contour Labelling

In this paper, a modified segmentation algorithm for printed Farsi words is presented. This algorithm is based on a previous work by Azmi that uses the conditional labeling of the upper contour to find the segmentation points. The main objective is to improve the segmentation results for low quality prints. To achieve this, various modifications on local baseline detection, contour labeling an...

متن کامل

Region growing based segmentation algorithm for typewritten and handwritten text recognition

This paper presents a new technique of high accuracy to recognize both typewritten and handwritten English and Arabic texts without thinning. After segmenting the text into lines (horizontal segmentation) and the lines into words, it separates the word into its letters. Separating a text line (row) into words and a word into letters is performed by using the region growing technique (implicit s...

متن کامل

Characters Segmentation of Cursive Handwritten Words based on Contour Analysis and Neural Network Validation

This paper presents a robust algorithm to identify the letter boundaries in images of unconstrained handwritten word. The proposed algorithm is based on vertical contour analysis. Proposed algorithm is performed to generate presegmentation by analyzing the vertical contours from right to left. The unwanted segmentation points are reduced using neural network validation to improve accuracy of se...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • JDCTA

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2009